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Abstract — An information theoretic perspective on group test- 
ing problems has recently been proposed by Atia and Saligrama, 
in order to characterise the optimal number of tests. Their 
results hold in the noiseless case, where only false positives 
occur, and where only false negatives occur. We extend their 
results to a model containing both false positives and false 
negatives, developing simple information theoretic bounds on the 
number of tests required. Based on these bounds, we obtain an 
improved order of convergence in the case of false negatives only. 
Since these results are based on (computationally infeasible) joint 
typicality decoding, we propose a belief propagation algorithm 
for the detection of defective items and compare its actual 
performance to the theoretical bounds. 

I. Introduction and Problem Outline 

The idea of group testing was introduced during World War 
II in order to reduce the cost of large scale blood tests by 
pooling blood samples together [5 1. Since then, it emerged as a 
promising approach in various applications, including multiple 
access communications and DNA clone library screening (cf. 
J6) and references therein). 

The advent of compressed sensing (CS) has revived interest 
in group testing Q, as both problems involve the detection of a 
sparse high-dimensional signal via a small number of random 
measurements. However, the compressed sensing literature 
has mostly focussed on problems with measurement matrices 
with entries taken from distributions with densities. Group 
testing naturally belongs in a broader framework of discrete 
compressed sensing, where the entries are random integers, 
often just Os and Is. This framework of discrete compressed 
sensing includes wider applications such as genotyping 0. An 
extension of the group testing problem to the scenario where 
pools must conform to the constraints imposed by a graph has 
also been studied recently (4). 

An information theoretic approach to a noisy version of 
group testing was recently developed by Atia and Saligrama 
ID, 0. We adopt much of their model and notation, which we 
will first briefly review. We will use group testing to identify K 
defective items within a larger collection of N items, by testing 
a pool of items at a time. Each test reveals whether the pool 
contains any defective items, i.e., the test result is positive, or 
1, if at least one of the items in the pool is defective, and it 
is otherwise negative, or 0. However, we will allow two types 
of errors to occur in the testing. 



1) False positives, where the test result is positive with 
probability q when the pool does not contain any defec- 
tive items. In other words, the result of the test is ORed 
with Bernoulli(g) random variable. 

2) False negatives - the indicator whether an item is 
defective is "diluted" with probability u. In other words, 
the result of the test will only be positive if the indicator 
of some defective item passes through a Z-channel 
successfully. 

Note that the false negatives make the analysis significantly 
more complicated than for standard coding theoretic problems. 
This is because this model makes the noise dependent on the 
input, because a pool with more defective items will be less 
likely to return a false negative. Interestingly, our results here 
indicate that false negatives are, in a certain sense, easier to 
deal with than false positives, and we obtain an improved order 
of convergence on the number of achievable tests in the case 
of false negatives only. 

Another way to describe the false negative process is that 
the test will be positive if the sum of the indicators, thinned 
in the sense of Renyi, is positive. In future work we hope to 
explore whether the bounds on entropy under thinning proved 
in 1 8 1 can improve or generalize the results of this paper. 

We now formally describe the model which incorporates the 
presence of these two kinds of testing errors. 

Definition 1: Let f3 G {0,1}^ be a column vector of 
indicators corresponding to the overall set of items, i.e., ft = 1 
iff item i is defective. We consider the case w(f3) = K <C N , 
where w(-) represents the Hamming weight. Furthermore, 
X = (xu) £ {0, l} TxAr will denote the measurement matrix, 
s.t. xu = 1 iff item i is pooled in test t. We will restrict our 
attention to the case where X is composed of i.i.d. Bern(p) 
entries. A set of test results is a vector y £ {0, 1} T , where 
yt = 1 means test t is positive. The outcome y t of test t is 
given by (symbol A stands for Boolean matrix product): 

yt = (x t AD t A/3)Vz t , t e {1,2,..., T}. (1) 

Here x t denotes the t-th row of X, D t e {0,1} A ' ><JV is a 
diagonal matrix with i.i.d. Bern(l — u) entries on the diagonal, 
independent of (3 and X, and z t is a Bern(q) random variable, 
independent of all others. 

This compact notation captures both the false positive test 
results which occur when z t = 1, and the false negative test 



results which occur in the event that all the diagonal entries of 
D f corresponding to the defective items in pool t equal zero. 

In (T), Atia and Saligrama showed how group testing can 
be viewed analogously to channel coding by considering a set 
of K channels, with input X/^ and the pair (Xik- i) ■> Y) as 
their output, i G {1, 2, . . . , K}. Here, Xu\ stands for the i 
entries in the row of the measurement matrix corresponding 
to (any) i defective items, and Y is the test outcome (viewed 
as a random variable). Atia and Saligrama prove the following 
result: 

Theorem 1: (fTj, Theorem 3.2) Consider the joint typicality 
decoder in the model of Definition Q] in the case where one 
or both of q = or u = (the model is noisefree, or allows 
false positives or false negatives, but not both). An achievable 
number of tests T typ which allows perfect detection is given 

us* 



T typ = max 



1 I(X( i y,X( K _ i - ) ,Y) 



(2) 



We now describe the structure of the remainder of this paper. 
We will consider the model of Definition [1] in the case where 
both q and u can be non-zero (both false positives and false 
negatives are allowed). For reasons of space, we will assume 
that an analogue of Equation (O holds in the case q > and 
u > 0. (To verify this requires a somewhat lengthy analysis of 
the probability that X and Y are jointly typical, as performed 
in the Appendix of ID). This means that the key quantity 
of interest is the mutual information I(X^y, X/^-i), Y). We 
will analyse this quantity in Section [TTJ as in [ 1 ] deducing 
asymptotic results of the form T typ — 0(K log(K(N — K)). 
We also deduce that in the case where only false negatives 
occur, the number of tests required can be reduced by a factor 
of \ogK. 

In Section [TTT1 we propose a belief propagation algorithm 
for the detection of the defective items in noisy group testing. 
The analysis of Theorem Q] is based on the use of a joint 
typicality decoder, which is infeasible in practice, having 
prohibitive computational complexity in the limit of large K 
and N. Belief propagation offers a practically implementable 
alternative. Belief propagation has previously been used in 
the statistical physics community to address the problem of 
noiseless group testing and its relationship to the hitting set 
problem J9|. 

II. Asymptotic Bounds 

In this section, we will derive sharp bounds on the mutual 
information of Equation (0: 

Lemma 1: The mutual information I(Xuy, X(K—i)i Y) can 
be expressed in closed form as 2j + 12, where the "lead term" 
is: 



h = l(l - q){l - p + pu) K 
pu 



1 — p + pu 
and the "error term" is: 



log 2 u - log 2 (l - p + pu) 



(3) 



h = 



1 OO 



log 2 



J=2 



(1 - qV 

ju-iy 



{l-p + pu^fil- 



(1 — p+puy 



(4) 



1 — p + pu° J 
Proof: As in (TJ, we decompose 

I{X {i) ;X {K _ i) ,Y)=H{Y\X {K _ i) )-H{Y\X {K) ), (5) 

and consider the two terms separately. First, we set V = X A 
DA/?, and notice that 

V(Y = 0\w(X (K) )=j) = 

V(Z = 0)¥(V = 0\w(X {K) )=j) = 

{l-q)u j , (6) 

which means that, writing h(-) for the binary entropy function, 

H{Y\X {K) ) = £ . )pi(l-p) K -*h [(1-qW] . (7) 



3=0 



Similarly, 



»(Y = 0\w(X {K _ i) ) = l) = 

V(Z = 0)V(V = 0\w(X (K _ i) ) = l) = 

l+i 

(l-q)J2[nV = 0\w(X {K) )=j). 

3=1 

W(w(X {K) )=j\w(X [K _ i) ) = l) 
l+i 

(l- q )J2n j Hw(X i{) )=j-l) = 
3=1 



3=0 



(l-q)u\l-p + pu)\ (8) 



whereby we obtain: 



K-i 



H{Y\X {K ^) = 



1=0 L 



K-i 



p\l-p) 



h((l- q)u\l-p + pu) 1 ) 



(9) 



We substitute the expressions and (O into Equation (0, 
and analyse the resulting sum. We use an expansion of binary 
entropy as 



h{6) = {-e\og 2 6) + 



0J 



^ . ; . • (10) 



log 2 \ friiU-i) 



with the first bracketed term becoming (01, and the remaining 
expression becoming ( fT8l ). ■ 
Observe that for any j > 2, the function g(p) = (1— p+pu- 7 ) — 
(1 —p + pv,y > 0. This means that the bracketed term in <TT~8T > 
is positive, and so as in fT), we could simply use the lower 
bound I(X(i) ; Xix-i) > Y) > I\ m Theorem Q] However, in 
many cases I 2 turns out to play a significant role, and so by 
including it in our analysis we obtain better bounds. 

Lemma 2: Choosing p = (1 — u)^ 1 / K, we obtain that for 
constant i, q and u 



h 



i(l — u)(l — q)(u\ogu — u+1) 
ifelog2 



O 



K 2 



(11) 



Proof: By setting p = a/K, and expanding Equation (0 
in powers of 1/K , we obtain 



m(l - g)e a ( u_1 )(ttlogtJ - u + 
ATelog2 



X 2 



(12) 



We can optimize this expression over a by taking a = 1/(1 — 
u), which justifies the heuristic choice of p = 1/K to define 
the measurement matrices in (TJ. ■ 
Note that in the cases u = q = and u = respectively we 
recover i/(Ke log 2) from (15) of [1] and i(l - q)/{Ke log 2) 
from (29) of (H. In the case q = 0, this optimal choice of 
a gives us a lower bound of i(l — u)/(2Kelog2), a slight 
improvement on (37) of Q). 

Similarly, it can be shown that with p = a/K, 



lim KI 2 = 

K^oo log 2 



E 

i=2 



(1 ~ g) j 

j"0" - 1) 



(13) 



It is easy to see that the series in (fT3l l is converging for g 7^ 0. 
Furthermore, by repeatedly using the sum 

9 j 



]T = d+(l-fl)bg(l-e), (14) 

and the obvious inequality < u 3 < u 2 , for j > 2, we obtain: 



ae 



log 2 



■C q>u < lim X7 2 < 



log 2 



(15) 



where 



Cg,« =?-(l-u + q'u)(l+logg-log(l-u + q'u)). (16) 

Notice that C g u = 00 when q = 0. This suggests that 7 2 is 
of a larger order in this case. Indeed, the following Lemma 
holds: 

Lemma 3: In case q = 0, I2 = 0( In particular, 



f?l-!l(l_u)< lim -^-/ 2 < ^^ e ™ 2 (l- u ). (17) 
log2 V ' ~ K^logK log2 V ' ' 

Proof: In 



/a = 



log 2^ 



(1 -p + pnJ)* 

j'O' - 1) 



(1 -p + pu) j 
1 — p + pu- 7 



(18) 



we notice that (1 -p + pu j ) K f e~ a(1 "" J) < e ~«( 1 - l ' 2 ) ; as 
K — > 00, whereas (1 — p + pvj) K > (1 — p) K . Therefore, by 
applying ( [Pfl ) with 8 = (1 — p + pu) 1 , we obtain that Mi. K, 



(1-P) K 
log 2 



_ (1 -p + pu) 1 
(i- P y 



(1 - (1 - p + pu) 1 ) log(l - (1 - p + pu) 1 ) 



(i- P y 

<h< 



< 



< 



-a{l-u 2 ) 

log 2 



1 



(1 -p + puf 
(1 — p + pu 2 )" 1 



(1 - (1 - p + pu)') log(l - (1 - p + ptt)') 
(1 — p + pu 2 ) 1 

Now, by developing both sides in powers of 1/K, 

ae~ a i 



■ (19) 



log 2 

(1 — u) [logK — log(m — aiu)] — u 



K 



< 



ae - a (l-u 2 ) i 



log 2 



(1 — u) [log K — log(cu — aiu)] — u + u 



K 



O 



K 2 



(20) 



which proves the claim. ■ 
Theorem 2: Assuming that an equivalent of Theorem Q] 
holds in the general case, using Lemmas [T]|2] and |3] we deduce: 

(i) For any q > 0, u > 0, as K — > 00, 



T typ = 0{Klog(K{N-K)). 



(21) 



In particular, 
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Figure 1. The bounds on the constant in the asymptotic estimate Tt yp 
0(K \og(K(N - K)) for u = 0.05, q = 0.01. 



ae- a C q , u < 



lim 

K^oa 



Klog(K(N- K)) 



T, 



typ 



< 



_g)( n lo gu - n +l) 

ae —(i-^) Cqu _ (22) 

(ii) For q = 0, and any u > 0, as K — > oo, 



log x 



(23) 



In particular, 

ae~ Q (l -u) < 



a-^oo r ty p log if 

-a(l-u 2 ) 



(1-u). (24) 



Notice that in the case of noiseless group testing, i.e., u = 0, 
q = 0, we arrive at the exact asymptotic expressions for T typ : 



r typ = eK(l+ lQgK ). 



(25) 



In the noisy case, the derived bounds are sharp. Fig. Q] 
depicts the quantities bounding MmK-^oo Kla %(K{N-K)) a s a 
function of a for u — 0.05 and q — 0.01. The bounds coincide 
in the first two decimal places. 

III. Belief Propagation Reconstruction 

The joint typicality decoder analysed in Section [TT] has 
prohibitive computational complexity in the limit of large K 
and AT. In this section, we compare the theoretical performance 
with the performance of belief propagation (BP) decoder, 
which performs an approximate bitwise MAP (maximum a 
posteriori) detection of defective items by solving: 



\{MAP) 
i 

The above can be transformed into: 



% MMr > = arg max P(ft \y), » € {1, 2, ... , N}. (26) 

ft6{0,l} 



\{MAP) 



rff max > 

x E 



N 



T 



3=1 



Y[P(yt\w((3 suppiyt) )) 



(27) 



= arg max 

L t=l 

N 

n(A<5 ft (l) + (l-A)^.(0)) 

3=1 

where A = K/N. Therefore, MAP detection amounts to the 
marginalisation of a function which permits a sparse factori- 
sation, and as such can be performed efficiently via message 
passing on a factor graph corresponding to the measurement 
matrix X. 

The belief propagation message-update rules are given by: 

m^^«(^ ft (l) + (l-A)^(0))- 

J] m^(ft), (28) 

&£jV(»)\{t} 



,(') 



P(ytk(^ a «pp(»t))) - 

n 



(29) 



jeAT(t)\{i} 

The fact that P(yt\(3 S upp(y t )) = P(yt\w(/3 supp{yt) )) greatly 
simplifies the message-passing update rules. In particular, 
since F(y t \w(/3 supp{m) )) = (1 - q)u w( - p '^'^ due to the 
symmetry between x t and f3 in (Q3, the above equations, by 
rewriting message-update rules in terms of log-ratios, i.e., 



simplify to: 
and 



log 7 l j 



— log ^(0)- (30) 



f (0 

&eJV(j)\{*} 6-H' 



Z = 0, 

z > 1, 



(31) 



log 



1 - u 



v i-a-^n.wwui} 



i+cx P (L^ t ) y / 



(32) 
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Figure 2. Probability of perfect reconstruction with BP at N = 5000, 
K = 50. 

in the case of a positive t-th test, i.e., when y t = 1, and simply 

L ( X = Iogt», (33) 

for y t = 0. 

In a preliminary assessment of the belief propagation recon- 
struction, we simulated BP decoder for noisy group testing in 
the case where N = 5000, K = 50, u = 0.05, q = 0.01, and 
for various values of parameter p. We performed at least 200 
trials at the various numbers of tests. The number of iterations 
was fixed to 50. 

As illustrated in Fig. [2] the detected probability of perfect 
reconstruction increases with p, and is about 99% when the 
number of tests was T ps 1600 for p = 0.02. The value of 
p which performs best here is 1/K, suggesting that the same 
heuristics concerning the optimal p discussed for typical set 
decoding also apply for belief propagation. 

In Fig. [3] we illustrated the number of detection errors 
per size of the support as a function of the number of tests. 
This figure illustrates that even though a large probability of 
perfect reconstruction is achieved only at the relatively large 
number of tests, the BP decoder typically diagnoses only 
a few items incorrectly at the number of tests as small as 
T ps 900. These results are still far from the estimate arising 
from the asymptotic analysis of joint typicality decoder in 
the previous section, which is Tt yp ps 400, but nonetheless 
confirm the utility of the belief propagation algorithm in noisy 
group testing, even though no design of the measurement 
matrix that complies well with belief propagation algorithm 
has been taken into consideration. It may also be possible to 
achieve further improvements in performance by using belief 
propagation with decimation as in (9). 

IV. Conclusions 

This contribution studies the information theoretic bounds 
arising in the problem of noisy group testing and proposes 
an efficient algorithm for noisy group testing based on belief 
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Figure 3. The number of detection errors per size of the support with BP 
at N = 5000, K = 50. 

propagation. We develop a sharp estimate on the constants 
arising in the asymptotic approximation of the number of 
tests sufficient for the perfect detection via a joint typicality 
decoder, as a function of the noise parameters. We show 
how the presence of the false positives in the noisy group 
testing changes the order of the achievable number of tests. 
These result allows us to benchmark the performance of a 
belief propagation algorithm. We restrict our attention here 
to the case where the measurement matrix is composed of 
i.i.d. Bernoulli entries. More general measurement matrices 
can be studied in a similar manner, in particular those with 
row weights generated according to a pre-optimised degree 
distribution. A judicious choice of degree distributions may 
further improve the performance of the belief propagation 
algorithm, in analogy with well known results in sparse graph 
coding. 
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